智能论文笔记

FedTADBench: Federated Time-Series Anomaly Detection Benchmark

Fanxing Liu , Cheng Zeng , Le Zhang , Yingjie Zhou , Qing Mu , Yanru Zhang , Ling Zhang , Ce Zhu

分类：机器学习

2022-12-19

Time series anomaly detection strives to uncover potential abnormal behaviors and patterns from temporal data, and has fundamental significance in diverse application scenarios. Constructing an effective detection model usually requires adequate training data stored in a centralized manner, however, this requirement sometimes could not be satisfied in realistic scenarios. As a prevailing approach to address the above problem, federated learning has demonstrated its power to cooperate with the distributed data available while protecting the privacy of data providers. However, it is still unclear that how existing time series anomaly detection algorithms perform with decentralized data storage and privacy protection through federated learning. To study this, we conduct a federated time series anomaly detection benchmark, named FedTADBench, which involves five representative time series anomaly detection algorithms and four popular federated learning methods. We would like to answer the following questions: (1)How is the performance of time series anomaly detection algorithms when meeting federated learning? (2) Which federated learning method is the most appropriate one for time series anomaly detection? (3) How do federated time series anomaly detection approaches perform on different partitions of data in clients? Numbers of results as well as corresponding analysis are provided from extensive experiments with various settings. The source code of our benchmark is publicly available at https://github.com/fanxingliu2020/FedTADBench.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training

Huaizhen Tang , Xulong Zhang , Jianzong Wang , Ning Cheng , Zhen Zeng , Edward Xiao , Jing Xiao

分类：人工智能

2022-08-08

非平行的多与众不同的语音转换仍然是一项有趣但具有挑战性的语音处理任务。最近，基于有条件的自动编码器的方法AutoVC通过使用信息限制的瓶颈来删除说话者身份和语音内容，从而实现了出色的转换结果。但是，由于纯粹的自动编码器训练方法，很难评估内容和说话者身份的分离效果。在本文中，一个新颖的语音转换框架，名为$ \ boldsymbol t $ ext $ \ boldsymbol g $ uided $ \ boldsymbol a $ utovc（tgavc），提议更有效地将内容和音色与语音分开，其中预期的内容嵌入其中根据文本转录生产的旨在指导语音内容的提取。此外，对对抗性训练将用于消除从语音中提取的估计内容中的说话者身份信息。在预期内容嵌入和对抗培训的指导下，对内容编码器进行了培训，以从语音中提取嵌入说话者的内容。 Aishell-3数据集的实验表明，所提出的模型在自然性和转换语音的相似性方面优于AUTOVC。

translated by 谷歌翻译

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Gong Cheng , Xiang Yuan , Xiwen Yao , Kebing Yan , Qinghua Zeng , Junwei Han

分类：计算机视觉

2022-07-28

随着深度卷积神经网络的兴起，对象检测在过去几年中取得了突出的进步。但是，这种繁荣无法掩盖小物体检测（SOD）的不令人满意的情况，这是计算机视觉中臭名昭著的挑战性任务之一，这是由于视觉外观不佳和由小目标的内在结构引起的嘈杂表示。此外，用于基准小对象检测方法基准测试的大规模数据集仍然是瓶颈。在本文中，我们首先对小物体检测进行了详尽的审查。然后，为了催化SOD的发展，我们分别构建了两个大规模的小物体检测数据集（SODA），SODA-D和SODA-A，分别集中在驾驶和空中场景上。 SODA-D包括24704个高质量的交通图像和277596个9个类别的实例。对于苏打水，我们收集2510个高分辨率航空图像，并在9个类别上注释800203实例。众所周知，拟议的数据集是有史以来首次尝试使用针对多类SOD量身定制的大量注释实例进行大规模基准测试。最后，我们评估主流方法在苏打水上的性能。我们预计发布的基准可以促进SOD的发展，并产生该领域的更多突破。数据集和代码将很快在：\ url {https://shaunyuan22.github.io/soda}上。

translated by 谷歌翻译

Cross-Modal 3D Shape Generation and Manipulation

Zezhou Cheng , Menglei Chai , Jian Ren , Hsin-Ying Lee , Kyle Olszewski , Zeng Huang , Subhransu Maji , Sergey Tulyakov

分类：计算机视觉

2022-07-24

创建和编辑3D对象的形状和颜色需要巨大的人类努力和专业知识。与3D接口中的直接操作相比，诸如草图和涂鸦之类的2D交互对用户通常更自然和直观。在本文中，我们提出了一个通用的多模式生成模型，该模型通过共享的潜在空间耦合2D模式和隐式3D表示。通过提出的模型，通过简单地通过潜在空间从特定的2D控制模式传播编辑，可以实现多功能3D生成和操纵。例如，通过绘制草图来编辑3D形状，通过绘画颜色在2D渲染上重新色彩，或者在一个或几个参考图像中生成特定类别的3D形状。与先前的作品不同，我们的模型不需要每个编辑任务进行重新训练或微调，并且在概念上也很简单，易于实现，对输入域移动的强大，并且可以在部分2D输入中进行多样化的重建。我们在灰度线草图和渲染颜色图像的两种代表性2D模态上评估了我们的框架，并证明我们的方法可以通过以下2D模态实现各种形状的操纵和生成任务。

translated by 谷歌翻译

Snow Mask Guided Adaptive Residual Network for Image Snow Removal

Bodong Cheng , Juncheng Li , Ying Chen , Shuyi Zhang , Tieyong Zeng

分类：计算机视觉

2022-07-11

在恶劣天气下的图像修复是一项艰巨的任务。过去的大多数作品都集中在消除图像中的雨水和阴霾现象。但是，雪也是一种极为普遍的大气现象，它将严重影响高级计算机视觉任务的性能，例如对象检测和语义分割。最近，已经提出了一些用于降雪的方法，大多数方法直接将雪图像作为优化对象。但是，雪地点和形状的分布很复杂。因此，未能有效地检测雪花 /雪连胜将影响降雪并限制模型性能。为了解决这些问题，我们提出了一个雪地掩模的自适应残留网络（SMGARN）。具体而言，SMGARN由三个部分组成，即Mask-Net，Guidance-Fusion Network（GF-NET）和重建-NET。首先，我们构建了一个以自像素的注意（SA）和跨像素的注意（CA），以捕获雪花的特征并准确地定位了雪的位置，从而预测了准确的雪山。其次，预测的雪面被发送到专门设计的GF-NET中，以适应指导模型去除雪。最后，使用有效的重建网络来消除面纱效果并纠正图像以重建最终的无雪图像。广泛的实验表明，我们的SMGARN数值优于所有现有的降雪方法，并且重建的图像在视觉对比度上更清晰。所有代码都将可用。

translated by 谷歌翻译

I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation

Yiwei Ding , Wenjin Deng , Yinglin Zheng , Pengfei Liu , Meihong Wang , Xuan Cheng , Jianmin Bao , Dong Chen , Ming Zeng

分类：计算机视觉 | 人工智能

2022-06-22

在本文中，我们介绍了人际内和人际关系网络（I^2R-NET），以进行多人姿势估计。它涉及两个基本模块。首先，人类内部关系模块在一个人身上运行，旨在捕获人类内部依赖性。其次，人际关系模块考虑了多个实例之间的关系，并着重于捕获人间的相互作用。人际关系间的关系模块可以通过减少特征图的分辨率来设计非常轻巧，但学习有用的关系信息以显着提高人类内部关系模块的性能。即使没有铃铛和哨子，我们的方法也可以竞争或胜过当前的比赛获胜者。我们对可可，人群和ochuman数据集进行了广泛的实验。结果表明，所提出的模型超过了所有最新方法。具体而言，所提出的方法在众群数据集上达到了77.4％的AP和Ochuman数据集上的67.8％AP，从而超过了现有方法的大幅度优于较大的利润率。此外，消融研究和可视化分析还证明了我们的模型的有效性。

translated by 谷歌翻译

Cervical Glandular Cell Detection from Whole Slide Image with Out-Of-Distribution Data

Ziquan Wei , Shenghua Cheng , Jing Cai , Shaoqun Zeng , Xiuli Liu , Zehua Wang

分类：计算机视觉

2022-05-29

颈腺细胞（GC）检测是计算机辅助诊断宫颈腺癌筛查的关键步骤。精确识别宫颈涂片中的GC是挑战的，其中鳞状细胞是主要的。在整个涂片线索中，广泛存在的分布（OOD）数据可降低机器学习系统用于GC检测的可靠性。尽管，最新的（SOTA）深度学习模型可以胜过感兴趣的预选区域中的病理学家，但是当面对这样的吉吉像素整个滑动图像时，质量假阳性（FP）预测仍无法解决。本文提出了一种基于GC的形态学知识，试图通过八邻居中的自我发项机制来解决FP问题的新极性知识。它估计了GC核的极性方向。作为插件模块，Polarnet可以指导一般对象检测模型的深度功能和预测的置信度。在实验中，我们发现基于四个不同框架的通用模型可以在小图像集中拒绝fp，并将平均精度（地图）的平均值增加$ \ text {0.007} \ sim \ sim \ text {0.015} $，其中平均最高超过了最近的宫颈细胞检测模型0.037。通过插入极地，部署的C ++程序在从外部WSI的前20个GC检测准确性上提高了8.8％，同时牺牲了14.4 s的计算时间。代码可在https://github.com/chrisa142857/polarnet-gcdet中找到

translated by 谷歌翻译

A hybrid data driven-physics constrained Gaussian process regression framework with deep kernel for uncertainty quantification

Cheng Chang , Tieyong Zeng

分类：机器学习

2022-05-13

Gaussian process regression (GPR) has been a well-known machine learning method for various applications such as uncertainty quantifications (UQ). However, GPR is inherently a data-driven method, which requires sufficiently large dataset. If appropriate physics constraints (e.g. expressed in partial differential equations) can be incorporated, the amount of data can be greatly reduced and the accuracy further improved. In this work, we propose a hybrid data driven-physics constrained Gaussian process regression framework. We encode the physics knowledge with Boltzmann-Gibbs distribution and derive our model through maximum likelihood (ML) approach. We apply deep kernel learning method. The proposed model learns from both data and physics constraints through the training of a deep neural network, which serves as part of the covariance function in GPR. The proposed model achieves good results in high-dimensional problem, and correctly propagate the uncertainty, with very limited labelled data provided.

translated by 谷歌翻译

What is Event Knowledge Graph: A Survey

Saiping Guan , Xueqi Cheng , Long Bai , Fujun Zhang , Zixuan Li , Yutao Zeng , Xiaolong Jin , Jiafeng Guo

分类：机器学习 | 人工智能 | 自然语言处理

2021-12-31

除了以实体为中心的知识之外，通常组织为知识图（千克），事件也是世界上的必不可少的知识，这触发了活动以kg（ekg）等事件为中心的知识表示形式的春天。它在许多机器学习和人工智能应用中起着越来越重要的作用，例如智能搜索，问答，推荐和文本生成。本文提供了历史，本体实例和应用视图的ekg综合调查。具体而言，要彻底地表征EKG，我们专注于其历史，定义，架构归纳，获取，相关代表图形/系统和应用程序。其中研究了发展过程和趋势。我们进一步总结了透视方向，以促进对EKG的未来研究。

translated by 谷歌翻译